To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 堰??意??蹂〓?v堰??意??蹂〓?vB 1000100110000001001111110011111110001000110100110011111100111111111001101111100010000001101011000011111101110110100010011000000100111111001111111000100011010011001111110011111111100110111110001000000110101100001111110111011001000010 89813f3f88d33f3fe6f881ac3f7689813f3f88d33f3fe6f881ac3f7642
EUC-JP 堰??意??蹂〓?v堰??意??蹂〓?vB 1011000111100001001111110011111110110000110101010011111100111111111011001111101010100010101011100011111101110110101100011110000100111111001111111011000011010101001111110011111111101100111110101010001010101110001111110111011001000010 b1e13f3fb0d53f3fecfaa2ae3f76b1e13f3fb0d53f3fecfaa2ae3f7642
UTF-8 堰쇰쑚意뤻뒽蹂〓뼸v堰쇰쑚意뤻뒽蹂〓뼸vB 111001011010000010110000111011001000011110110000111011001001000110011010111001101000010010001111111010111010010010111011111010111001001010111101111010001011100110000010111000111000000010010011111010111011110010111000011101101110010110100000101100001110110010000111101100001110110010010001100110101110011010000100100011111110101110100100101110111110101110010010101111011110100010111001100000101110001110000000100100111110101110111100101110000111011001000010 e5a0b0ec87b0ec919ae6848feba4bbeb92bde8b982e38093ebbcb876e5a0b0ec87b0ec919ae6848feba4bbeb92bde8b982e38093ebbcb87642
UHC 堰쇰쑚意뤻뒽蹂〓뼸v堰쇰쑚意뤻뒽蹂〓뼸vB 111001011110100010111100111010111001110010111001111010111111001010001111111010011000101010110011111010111011001110100001111010111001011010111011011101101110010111101000101111001110101110011100101110011110101111110010100011111110100110001010101100111110101110110011101000011110101110010110101110110111011001000010 e5e8bceb9cb9ebf28fe98ab3ebb3a1eb96bb76e5e8bceb9cb9ebf28fe98ab3ebb3a1eb96bb7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)