To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????v???????????vB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111011000111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f3f3f7642
SJIS-WIN 鄒カ鬧育オり京繝ッ蠕党v鄒カ鬧育オり京繝ッ蠕党vB 1110011110111110101101101110100110100111100010001110011110110101100000101110100010001011100111101110001110000011101011111110010110111110100100110111110101110110111001111011111010110110111010011010011110001000111001111011010110000010111010001000101110011110111000111000001110101111111001011011111010010011011111010111011001000010 e7beb6e9a788e7b582e88b9ee383afe5be937d76e7beb6e9a788e7b582e88b9ee383afe5be937d7642
EUC-JP 鄒カ鬧育オり京繝ッ蠕党v鄒カ鬧育オり京繝ッ蠕党vB 1110111011000000100011101011011011110010101010011011000011101001100011101011010110100100111010101011010111111110111001011110001110001110101011111110101011000000110001011101111001110110111011101100000010001110101101101111001010101001101100001110100110001110101101011010010011101010101101011111111011100101111000111000111010101111111010101100000011000101110111100111011001000010 eec08eb6f2a9b0e98eb5a4eab5fee5e38eafeac0c5de76eec08eb6f2a9b0e98eb5a4eab5fee5e38eafeac0c5de7642
UTF-8 鄒カ鬧育オり京繝ッ蠕党v鄒カ鬧育オり京繝ッ蠕党vB 111010011000010010010010111011111011110110110110111010011010110010100111111010001000001010110010111011111011110110110101111000111000001010001010111001001011101010101100111001111011100110011101111011111011110110101111111010001010000010010101111001011000010110011010011101101110100110000100100100101110111110111101101101101110100110101100101001111110100010000010101100101110111110111101101101011110001110000010100010101110010010111010101011001110011110111001100111011110111110111101101011111110100010100000100101011110010110000101100110100111011001000010 e98492efbdb6e9aca7e882b2efbdb5e3828ae4baace7b99defbdafe8a095e5859a76e98492efbdb6e9aca7e882b2efbdb5e3828ae4baace7b99defbdafe8a095e5859a7642
UHC 鄒?鬧育?り京????v鄒?鬧育?り京????vB 1111010111011011001111111101011110100010111010111100000000111111101010101110101011001100110010000011111100111111001111110011111101110110111101011101101100111111110101111010001011101011110000000011111110101010111010101100110011001000001111110011111100111111001111110111011001000010 f5db3fd7a2ebc03faaeaccc83f3f3f3f76f5db3fd7a2ebc03faaeaccc83f3f3f3f7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)