To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}v?????????}vB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101110110001111110011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f3f7d7642
SJIS-WIN 瘟??央??言??}v瘟??央??言??}vB 1110000110001001001111110011111110001001100110110011111100111111100011001011111000111111001111110111110101110110111000011000100100111111001111111000100110011011001111110011111110001100101111100011111100111111011111010111011001000010 e1893f3f899b3f3f8cbe3f3f7d76e1893f3f899b3f3f8cbe3f3f7d7642
EUC-JP 瘟??央??言??}v瘟??央??言??}vB 1110000111101001001111110011111110110001111110110011111100111111101110001100000000111111001111110111110101110110111000011110100100111111001111111011000111111011001111110011111110111000110000000011111100111111011111010111011001000010 e1e93f3fb1fb3f3fb8c03f3f7d76e1e93f3fb1fb3f3fb8c03f3f7d7642
UTF-8 瘟룡릍央뉓갬言됪쓳}v瘟룡릍央뉓갬言됪쓳}vB 1110011110011000100111111110101110100011101000011110101110100110100011011110010110100100101011101110101110001001100100111110101010110000101011001110100010101000100000001110101110010000101010101110110010010011101100110111110101110110111001111001100010011111111010111010001110100001111010111010011010001101111001011010010010101110111010111000100110010011111010101011000010101100111010001010100010000000111010111001000010101010111011001001001110110011011111010111011001000010 e7989feba3a1eba68de5a4aeeb8993eab0ace8a880eb90aaec93b37d76e7989feba3a1eba68de5a4aeeb8993eab0ace8a880eb90aaec93b37d7642
UHC 瘟룡릍央뉓갬言됪쓳}v瘟룡릍央뉓갬言됪쓳}vB 1110100010110000101101111110011010111000101011001110010011100111100001111110100010110000101101111110010111101011100010011110011010011101100100010111110101110110111010001011000010110111111001101011100010101100111001001110011110000111111010001011000010110111111001011110101110001001111001101001110110010001011111010111011001000010 e8b0b7e6b8ace4e787e8b0b7e5eb89e69d917d76e8b0b7e6b8ace4e787e8b0b7e5eb89e69d917d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)