To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????Lh?????????L 001111110011111100111111001111110011111100111111001111110011111100111111010011000110100000111111001111110011111100111111001111110011111100111111001111110011111101001100 3f3f3f3f3f3f3f3f3f4c683f3f3f3f3f3f3f3f3f4c
SJIS-WIN 筌??泣??碎κ?Lh筌??泣??碎κ?L 1110001010100011001111110011111110001011100000110011111100111111111000011110101010000011110010000011111101001100011010001110001010100011001111110011111110001011100000110011111100111111111000011110101010000011110010000011111101001100 e2a33f3f8b833f3fe1ea83c83f4c68e2a33f3f8b833f3fe1ea83c83f4c
EUC-JP 筌™?泣??碎κ?Lh筌™?泣??碎κ?L 111001001010010110001111101000101110111100111111101101011110001100111111001111111110001011101100101001101100101000111111010011000110100011100100101001011000111110100010111011110011111110110101111000110011111100111111111000101110110010100110110010100011111101001100 e4a58fa2ef3fb5e33f3fe2eca6ca3f4c68e4a58fa2ef3fb5e33f3fe2eca6ca3f4c
UTF-8 筌™뫂泣뺧㎖碎κ킐Lh筌™뫂泣뺧㎖碎κ킐L 11100111101011011000110011100010100001001010001011101011101010111000001011100110101100111010001111101011101110101010011111100011100011101001011011100111101000101000111011001110101110101110110110000010100100000100110001101000111001111010110110001100111000101000010010100010111010111010101110000010111001101011001110100011111010111011101010100111111000111000111010010110111001111010001010001110110011101011101011101101100000101001000001001100 e7ad8ce284a2ebab82e6b3a3ebbaa7e38e96e7a28ecebaed82904c68e7ad8ce284a2ebab82e6b3a3ebbaa7e38e96e7a28ecebaed82904c
UHC 筌™뫂泣뺧㎖碎κ킐Lh筌™뫂泣뺧㎖碎κ킐L 111011111010011110100010111000101001000110100110111010111110100010010101111011111010011110100010111000011110111110100101111010101011010010011100010011000110100011101111101001111010001011100010100100011010011011101011111010001001010111101111101001111010001011100001111011111010010111101010101101001001110001001100 efa7a2e291a6ebe895efa7a2e1efa5eab49c4c68efa7a2e291a6ebe895efa7a2e1efa5eab49c4c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)