To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????Lh?????????L 001111110011111100111111001111110011111100111111001111110011111100111111010011000110100000111111001111110011111100111111001111110011111100111111001111110011111101001100 3f3f3f3f3f3f3f3f3f4c683f3f3f3f3f3f3f3f3f4c
SJIS-WIN 鸚??肄?????Lh鸚??肄?????L 11101010010111110011111100111111111000111110010100111111001111110011111100111111001111110100110001101000111010100101111100111111001111111110001111100101001111110011111100111111001111110011111101001100 ea5f3f3fe3e53f3f3f3f3f4c68ea5f3f3fe3e53f3f3f3f3f4c
EUC-JP 鸚??肄?????Lh鸚??肄?????L 11110011110000000011111100111111111001101110011100111111001111110011111100111111001111110100110001101000111100111100000000111111001111111110011011100111001111110011111100111111001111110011111101001100 f3c03f3fe6e73f3f3f3f3f4c68f3c03f3fe6e73f3f3f3f3f4c
UTF-8 鸚룸뗄肄쎾ㅇ流곸뒏Lh鸚룸뗄肄쎾ㅇ流곸뒏L 111010011011100010011010111010111010001110111000111010111001011110000100111010001000001010000100111011001000111010111110111000111000010110000111111011111010011110001010111010101011001110111000111010111001001010001111010011000110100011101001101110001001101011101011101000111011100011101011100101111000010011101000100000101000010011101100100011101011111011100011100001011000011111101111101001111000101011101010101100111011100011101011100100101000111101001100 e9b89aeba3b8eb9784e88284ec8ebee38587efa78aeab3b8eb928f4c68e9b89aeba3b8eb9784e88284ec8ebee38587efa78aeab3b8eb928f4c
UHC 鸚룸뗄肄쎾ㅇ流곸뒏Lh鸚룸뗄肄쎾ㅇ流곸뒏L 111001011010010010110111111010111011011010111111111011001011110110011011111001011010010010110111111010101111110010000001111011001000101010001100010011000110100011100101101001001011011111101011101101101011111111101100101111011001101111100101101001001011011111101010111111001000000111101100100010101000110001001100 e5a4b7ebb6bfecbd9be5a4b7eafc81ec8a8c4c68e5a4b7ebb6bfecbd9be5a4b7eafc81ec8a8c4c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)